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Abstract 

Results on two-particle angular correlations for charged particles emitted in proton- 
proton collisions at center-of-mass energies of 0.9, 2.36, and 7 TeV are presented, using 
data collected with the CMS detector over a broad range of pseudorapidity (n) and 
azimuthal angle ((p). Short-range correlations in An, which are studied in minimum 
bias events, are characterized using a simple "independent cluster " parametrization 
in order to quantify their strength (cluster size) and their extent in n (cluster decay 
width). Long-range azimuthal correlations are studied differentially as a function of 
charged particle multiplicity and particle transverse momentum using a 980 nb _1 
data set at 7 TeV. In high multiplicity events, a pronounced structure emerges in 
the two-dimensional correlation function for particle pairs with intermediate pj of 
1-3 GeV/c, 2.0 < \An\ < 4.8 and A<p ?»0. This is the first observation of such a long- 
range, near-side feature in two-particle correlation functions in pp or pp collisions. 
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1 Introduction 

This paper presents measurements of two-particle angular correlations of charged particles 
emitted in proton-proton (pp) collisions at center of mass energies (y^s) of 0.9, 2.36, and 7 TeV. 
This first study of short- and long-range correlations in pp collisions at the LHC high energy 
frontier provides important information for characterizing Quantum Chromodynamics (QCD) 
in this energy regime, especially the mechanism of hadronization and possible collective ef- 
fects due to the high particle densities reached in these collisions. Multiparticle correlations in 
high energy collisions have been measured previously for a broad range of collision energies 
and colliding systems with the goal of understanding the underlying mechanism of particle 
production |[THZl- 

Two related studies of angular correlations have been performed using two-dimensional At]-A(p 
correlation functions. Here At] is the difference in pseudorapidity rj (=— ln(tan(#/2)), where 6 
is the polar angle relative to the beam axis) between the two particles and A(p is the difference 
in their azimuthal angle (p (in radians). In a first analysis, pp data collected with a minimum 
bias trigger at 0.9, 2.36, and 7 TeV were used to study short-range correlations (\At] \ less than 
!%2). In a second study, the long-range structure (2.0 < \At]\ < 4.8) of two-particle correlation 
functions was examined as a function of charged particle multiplicity and particle transverse 
momentum for a large data set at 7 TeV. 

In short-range correlations in minimum bias events, a peak with a typical width of about one 
unit in Arj is observed. A useful way to quantify this effect is to assume that the initial inter- 
actions emit so-called "independent clusters", which subsequently decay isotropically in their 
own rest frame into the observed hadrons EHS EHZl - This simple independent cluster model 
(ICM) parametrization of the observed correlation function allows a quantitative comparison 
of data and models for different collision energies and collision systems. The observed corre- 
lation strength and extent in relative pseudorapidity between the particles are parametrized 
by a Gaussian distribution. The fitted parameters in this ansatz are the cluster multiplicity or 
"size" (the average number of particles into which a cluster decays) and the decay "width" 
(the spread of the daughter particles in pseudorapidity). This ansatz is only a phenomenolog- 
ical parametrization which provides no insight as to the nature of the assumed clusters nor to 
the mechanisms by which clusters are formed. Relating these results to the underlying QCD 
dynamics requires further modeling. 

To investigate long-range azimuthal correlations (2.0 < |A?/| < 4.8), a high-statistics data set 
of high multiplicity pp events at 7 TeV was used. In current pp Monte Carlo (MC) event gen- 
erators, the typical sources of such long-range correlations are momentum conservation and 
away-side (A<^ n) jet correlations. Measurements at the Relativistic Heavy Ion Collider 
(RHIC) have revealed that the long-range structure of two-particle angular correlation func- 
tions is significantly modified by the presence of the hot and dense matter formed in relativis- 
tic heavy ion collisions [7J. Several novel correlation structures over large Arj were observed in 
azimuthal correlations for intermediate particle transverse momenta, pj « 1 — 5 GeV/c H8j|9|. 
Since the particle densities produced in the highest multiplicity pp collisions at LHC energies 
begin to approach those in high energy collisions of relatively small nuclei such as copper |[T0|, 
it is natural to search for the possible emergence of new features in the two-particle correla- 
tion function from high multiplicity pp events JTTHIH1 - Therefore, the azimuthal (Acp) correla- 
tion functions from the large data set at 7 TeV have been studied differentially by binning the 
events in the observed charged particle multiplicity and by selecting particle pairs in bins of 
the transverse momentum of the particles. 

The paper is organized as follows: the experimental setup, event triggering, and event selec- 
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2 Experimental Setup, Triggering, and Event Selections 



tion for both analyses are described in Section [2] Criteria used to select tracks are listed in 
Section [3] The general procedure for calculating the correlation functions and the CMS-specific 
efficiency corrections are described in Sections [4] and |5} respectively. Results for the analysis of 
short-range correlations in minimum bias data using the cluster parametrization are given in 
Section |6j The study of long-range correlations as a function of event multiplicity and particle 
transverse momentum is detailed in Section IT] 

2 Experimental Setup, Triggering, and Event Selections 

This analysis used three data sets collected with pp interactions at y/s = 0.9, 2.36, and 7 TeV. A 
detailed description of the CMS experiment can be found in Ref. 1 19]. The detector subsystems 
used for the present analysis are the pixel and silicon-strip tracker (SST), covering the region 
| rj\ < 2.5 and immersed in a 3.8 T axial magnetic field. The lead tungstate crystal electromag- 
netic calorimeter (ECAL), the brass/ scintillator hadron calorimeter (HCAL), and the forward 
calorimeter (HF, covering the region 2.9 < \tj\ < 5.2), were also used for online and offline event 
selections. The detailed MC simulation of the CMS detector response is based on GEANT4 ]20J. 

Any hit in the beam scintillator counters (BSC, 3.23 < \rj\ < 4.65) coinciding with colliding 
proton bunches was used for triggering the data acquisition in the minimum bias trigger. To 
preferentially select non-single-diffractive (NSD) events, a coincidence of at least one HF calor- 
imeter tower with more than 3 GeV of total energy on each of the positive and negative sides 
was required. Events were also required to contain at least one reconstructed primary vertex 
(PV) that fell within a 4.5 cm of the nominal collision point along the beam axis and within 
a radius of 0.15 cm measured perpendicular to the beam relative to the average vertex posi- 
tion, and to contain at least three fully reconstructed tracks associated with the primary ver- 
tex. Outside this relatively narrow vertex range, the density of events was too small to ensure 
enough statistics for constructing the random background distribution (Section |4]| in small bins 
of the longitudinal (z) vertex position. Beam-halo and other beam-background events were 
rejected as described in Ref. [21J. The contamination of background events after selections in 
the colliding-bunch data sample was found to be negligible (< 0.1%). 

After all selections are applied, the total number of events used for the minimum bias analysis 
of cluster properties described in Section g is 168 854 (3.3 jib' 1 ) for 0.9 TeV, 10902 (0.2 ^b^ 1 ) 
for 2.36 TeV, and 150086 (3.0 }ib~ l ) for 7 TeV, where the numbers in parentheses are the ap- 
proximate integrated luminosity for the individual data samples. The systematic uncertainties 
in the results shown in Section [6] significantly exceed the statistical uncertainties for the 150k 
event minimum bias data sample at 7 TeV, so no further events were included in this analysis. 

In order to investigate the properties of the high multiplicity pp collisions, a dedicated high 
multiplicity trigger was designed and implemented in the two levels of the CMS trigger system. 
At Level 1 (LI), the total transverse energy summed over the entire set of CMS calorimeters 
(ECAL, HCAL, and HF) was required to be greater than 60 GeV. At the high-level trigger (HLT), 
online tracks built from the three layers of pixel detectors with a track origin within a cylindrical 
region of 21 cm along the beam and 0.5 cm in the transverse radius were used in an online 
vertexing algorithm. The number of pixel tracks (N°^ me ) with \tj\ < 2, pj > 0.4GeV/c, and a 
distance of closest approach of 0.12 cm or less to the best vertex (the one associated with the 
highest number of tracks) was determined for each event. Data were taken with a threshold 
initially set to N°£ lme > 70. During later, higher-luminosity running, the lower limit was raised 
to 85. 

The total integrated luminosity for the high multiplicity analysis was 980 nb- 1 . The total num- 
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ber of events in each of the bins of offline reconstructed track multiplicity, N t °^ me , used in the 
analysis are listed in Table [l] To take advantage of the full acceptance of the CMS tracking 
system, N°^ me includes tracks within \t]\ < 2.4 (see Section [3] for other offline track selection 
criteria). The table also lists the average values of N°^ ine as well as the average of ]V t c r ° rrected , 
the event multiplicity corrected for all detector and algorithm inefficiencies, as described in 
Section |5l 

Table 1: Number of events for each multiplicity bin used in the 7 TeV analysis with total 
integrated luminosity of 980 nb _1 . The multiplicity of offline reconstructed tracks, N°^ me , was 
counted within the kinematic cuts of \r/\ < 2.4 and pj > 0.4GeV/c. The last two columns list 
the average values of N°^ me as well as the average of ]\f t c r ° rrected , the event multiplicity corrected 
for all detector and algorithm inefficiencies. 



Multiplicity bin (N t °/if ine ) 


Event Count 


(^offline) 


/ ^corrected ^ 


MinBias 


21.43M 


15.9 


17.8 


N trk 1116 < 35 


19.36M 


13.0 


14.1 


35 < Nff™ < 90 


2.02M 


45.3 


53.1 


90 < Ngf™ < 110 


302.5k 


96.6 


111.7 


^offline > no 


354.0k 


117.8 


136.1 



3 Track Selection 

In this analysis, the so-called CMS highPurity l|22|l tracks were used. Additionally, a recon- 
structed track was considered as a primary-track candidate if the significance of the separation 
along the beam axis, z, between the track and the primary vertex, d z /a(d z ), and the significance 
of the impact parameter relative to the primary vertex transverse to the beam, d xy / a(d xy ), were 
each less than 3. In order to remove tracks with potentially poorly reconstructed momentum 
values, the relative uncertainty of the momentum measurement, cr(pj) I pr, was required to be 
less than 10%. 

To ensure reasonable tracking efficiency and low fake rate, only tracks within \rj\ < 2.4 and 
above a minimum pj value were used. For the inclusive analysis, the selected range was 
0.1 GeV/c < pj < 5.0 GeV/c. The effect of the upper limit imposed on pj is negligible. The 
effects of the lower pj cut, as well as the effect of the t] restriction on the determination of 
cluster parameters from Arj correlations, are significant and will be discussed in more detail 
below. To avoid possible bias in the high multiplicity analysis, the lower cutoff was raised to 
pj > 0.4 GeV/c when classifying the event multiplicity in order to match the cut applied in the 
online tracking. 



4 Calculation of the Two-Particle Correlation Function 

For both minimum bias and high-multiplicity triggered collision events, the first step in ex- 
tracting the correlation function was to divide the sample into bins in track multiplicity. For the 
minimum bias sample, 10 bins were used, each containing about the same number of events. 
Following an approach similar to that in Refs. Pl|6), the pr-inclusive charged two-particle cor- 
relation as a function of Arj and A(p is defined as follows: 
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5 Corrections for Tracking and Event Selection Inefficiencies 



where Sjv and Bjv are the signal and random background distributions, defined in Eqs. ^ and 
^ respectively, At](= tj\ — J/2) and Acp(= cp\ — cpz) are the differences in pseudorapidity and 
azimuthal angle between the two particles, (N) is the number of tracks per event averaged over 
the multiplicity bin, and the final R(Ar/, Acp) is found by averaging over multiplicity bins. For 
simplicity in Eq. |T]) and the discussion in this section, N is used to represent the total number of 
offline reconstructed tracks per event. Note that the order in which the particles are considered 
has no significance. The quantities Arj and A(p are always taken to be positive and used to 
fill one quadrant of the Atj, A(p histograms with the other three quadrants filled by reflection. 
Therefore, the resulting distributions are symmetric about (A?/,A</>)=(0,0) by construction. 

For each multiplicity bin, the signal distribution: 

1 d 2 N si & nal 

S M (A^) = w(w _ 1)rfA>; ^ (2) 

was determined by counting all particle pairs within each event, using the weighting factor 
N(N — 1), then averaging over all events. This represents the charged two-particle pair density 
function normalized to unit integral. The background distribution: 

1 c?2Arnuxed 

denotes the distribution of uncorrelated particle pairs representing a product of two single- 
particle distributions, also normalized to unit integral. This distribution was constructed by 
randomly selecting two different events within the same multiplicity bin and pairing every 
particle from one event with every particle in the other (in this case, the normalization factor 
1/N 2 corresponds to I/N1N2 event-by-event). The pairs of events used to compute the back- 
ground were also required to be within the same 0.5 cm wide bin in the vertex location along 
the beam. 

As indicated in Eq. the ratio of Sn(At], Acp) to B^{Atj, Acp) was first calculated in each multi- 
plicity bin. Dividing the background in this way corrects for detector effects such as tracking in- 
efficiencies, non-uniform acceptance, etc. The ratio of signal to background was then weighted 
by the track multiplicity factor, (N) — 1 (where (N) is the average multiplicity in each bin), and 
averaged over all the multiplicity bins to arrive at the final two-particle correlation function 
R{Af],Acp). 



5 Corrections for Tracking and Event Selection Inefficiencies 
5.1 Correction for Tracking Inefficiency 

Studies with simulated events showed that the combined geometrical acceptance and recon- 
struction efficiency for the global track reconstruction exceeds 50% around pj ~ 0.1 GeV/c 
over the full CMS tracker acceptance (\rj\ < 2.4) for charged hadrons. The efficiency is greater 
than 90% in the \t]\ < 1 region for pj > 0.6 GeV/c. Detailed studies of tracking efficiencies 
using MC -based and data-based methods can be found in [23 j. The tracking efficiency cor- 
rection factor, e trk , was determined by taking the ratio of the number of reconstructed tracks 
(N trk ) to that of generator level primary charged particles (N gen ) in the simulated MC events as 
a function of pj, f], z vtx : 

trk/ \ N trk (//,p T ,2; v tx) 



5.2 Event Selection Correction for Minimum Bias Data 
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In constructing the signal and background distributions, this correction was applied as an in- 
verse weight, l/e trk (t], pT/Z v tx), to each particle. After this correction, the two-particle correla- 
tion function found using reconstructed tracks from simulated events matched that obtained 
at the generator level to within 1.4%. 

Using simulations, the tracking efficiency was found to have little or no dependence on mul- 
tiplicity within the range studied in the present work. The fake rate did increase slightly with 
multiplicity but remained at the 1-2% level. Therefore, the corrections applied for tracking 
efficiency and fake rate were independent of event multiplicity. 

5.2 Event Selection Correction for Minimum Bias Data 

For the minimum bias data, inefficiencies in triggering and vertex reconstruction of low multi- 
plicity events resulted in multiplicity distributions of reconstructed tracks which were biased 
toward higher average values. The correction factor for this effect, e evt , was determined by 
taking the ratio of two generator-level MC multiplicity distributions, one with offline event se- 
lection applied (N|^ Sel ) and one for all NSD MC events (NgjP). In Eq. j§|, N^ e represents the 
true number of particles in the event. The NSD event selection efficiency: 

ArevtSel/AjtrueN 

Wrk ) ~ N NSD (Nr) W 

is about 50% at N^ e =6 and reaches 100% around N^ e =15. When calculating the correlation 
function, each event was weighted by the inverse of the event selection efficiency evaluated at 
N t c r j, rrected which is the number of particles corrected for acceptance and tracking efficiency as 
described above, 1 / e evt (N^ iected ) . 

5.3 Event Selection Correction for High Multiplicity Data 

The two high-multiplicity trigger thresholds used in the HLT (see Section|2]| give different trig- 
ger efficiencies. Distributions for offline reconstructed track multiplicity, N°^ me , in minimum 
bias and high multiplicity triggered events at 7 TeV are shown in the top panels of Fig. [I] Kine- 
matic cuts of \y\ I < 2.4 and pj > 0.4 GeV/c were used in defining N^ e ( see Section 3 for other 
offline track selections). The statistics of events with N°^ me > 110 were enhanced by a factor 
of about 1000 with the high multiplicity trigger relative to the minimum bias trigger due the 
large prescale factor applied to the latter sample. The lower panels in Fig. [I] show the HLT ef- 
ficiency, obtained from data, of the two high multiplicity triggers relative to the minimum bias 
trigger. The LI triggering efficiency (not shown in Fig. [T]) is not a concern since it reaches 100% 
efficiency for events with N°^ ine > 90. A weight given by the inverse of the HLT efficiency, 
e^,f T (N°f lme ), was applied to all pairs from a given event. 



6 Short-Range Correlations in 0.9, 2.36, and 7 TeV Data 

The final two-particle inclusive correlation functions are shown in Fig.|2]as a function of At] and 
A<p at Vs = 0.9, 2.36, and 7 TeV. A small region with \ At]\ < 0.06 and \ A(p\ < 0.06 was excluded 
in both signal and background distributions in order to reject residual secondary effects (i.e., 
any tracks from photon conversions, weak decays, or ^-electrons which were not rejected by 
the cut on the projected distance of the track from the vertex). 

The complex two-dimensional (2-D) correlation structure shown in Fig.|2]is dominated by three 
prominent components: a narrow peak at (At],A(p)^(0,0) which can be understood as the con- 
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6 Short-Range Correlations in 0.9, 2.36, and 7 TeV Data 




Figure 1: Distributions of the number of tracks reconstructed in offline analysis, N?j* , for 
minimum bias events, as well as high-multiplicity triggered events, both at 7 TeV, with online 
multiplicity N°^ ine greater than (a) 70 and (b) 85. The total integrated luminosity of the data set 
is 980 nb- 1 . The minimum bias trigger was heavily prescaled during higher luminosity LHC 
running. The HLT efficiency turn-on curves for the two high multiplicity triggers are shown in 
the two panels at the bottom. 



(a) CMSVs = 0.9TeV (b) CMSVs = 2.36TeV (c) CMS\fi = 7TeV 




Figure 2: Two-particle correlation functions versus At] and Acp in pp collisions at y/s = (a) 0.9, 
(b) 2.36, and (c) 7 TeV. 

tribution from higher pj clusters (e.g., hard processes like jets); a ridge at Acp n spread over 
a broad range in At], interpreted as due to away-side jets or more generally momentum con- 
servation; and an approximately Gaussian ridge at At] f«0 extending over the whole range of 
Acp, becoming broader toward larger Acp values, which arises from the decay of clusters with 
lower pj (e.g., soft QCD string fragmentation). This broadening will be discussed in quantita- 
tive detail later in this section. The PHOBOS experiment at RHIC observed similar correlation 
structures in pp collisions at \/s = 200 GeV and 410 GeV [6|. Qualitatively similar structures 
also exist in PYTHIA (Fig. [3] for D6T tune |24l ) although they do not reproduce the strength 
of the correlations seen in the data. The qualitative features of the observed correlations in 
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(a) PYTHIAVs = 0.9TeV 




(b) PYTHIAVs = 2.36TeV 



-©- 
< 



(c) PYTHIAVs = 7TeV 





Figure 3: Two-particle correlation functions versus At] and Acp in PYTHIA D6T tune at y^s = 
(a) 0.9, (b) 2.36, and (c) 7 TeV. 

the data are also consistent with an independent cluster approach according to a simulation 
study from the ISR experiment using a low-mass resonance (p, to, tj) gas model [3| and a MC 
model of isotropic cluster decays from the PHOBOS experiment 0- Bose-Einstein Correla- 
tions (BEC, also known as the Hanbury-Brown and Twiss effect |25|) have been measured in 
pp collisions M26T28I but their influence on the extracted cluster parameters has been found to 
be negligible HJ. 

To quantify one aspect of the correlation structure, the 2-D correlation functions were reduced 
to one-dimensional (1-D) functions of At] by integrating Sn(Aj/, A<p) and Bn(Atj, Acp) over A(p: 

J S N {Ar],A<p)dA<p 



R(A V ) = (((N)-1) 



J B N (Ari,A<p)dA<p 



(6) 



bins 





(c) CMSVs = 7TeV 0<|A<|>|<7f 




Figure 4: Two-particle pseudorapidity correlation function, obtained by averaging over the 
entire A(p range from to n, in pp collisions at = (a) 0.9, (b) 2.36, and (c) 7 TeV. The solid 
curves correspond to the fits by the cluster model using Eq. |7j|. Error bars are smaller than the 
symbols. 

The 1-D two-particle pseudorapidity correlation functions, R(Arj), where A(p was averaged 
over the entire range from to n, are shown for all three energies in Fig. |4] 

In the context of an ICM description, R(Atj) can be parametrized using the functional form 0: 

T(Ajy) 



R(Af]) 



(K(K— 1)) 

where the correlation strength a = v ^ ;/ depends on the average numbers of particles into 
which a cluster decays, the cluster size K. The function T(At]) is a Gaussian function propor- 



B(At]) 



(7) 
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6 Short-Range Correlations in 0.9, 2.36, and 7 TeV Data 
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Figure 5: (a) K e g and (b) 5 as a function of ys, measured for pj > 0.1 GeV/c and \rj\ < 2.4 by 
CMS in solid circles. Open circles show the PYTHIA results with the D6T tune. 



tional to 



exp[-{At]) 2 /(4S 2 )} 



where 5 quantifies the average spread of particles originating from a single cluster, i.e. the decay 
width. The background distribution, B(Atj), in Eq. ^ is the same event-mixed distribution 
defined in Eq. ^ but averaged over all the multiplicity bins with all corrections applied, and 
integrated over Acp. 

Without knowing o~k, the width of the distribution of K, it is impossible to calculate the average 
cluster size (K) directly from the measured value of oc. However, an effective cluster size can 
be defined using the extracted correlation strength via the relation: 



K e ff = a. + 1 



(*(*-!)) 
(K) 



+ 1 = (K) + 



(8) 



The effective cluster size K e g and decay width 5 can be estimated by means of a least x 2 fit of 
Eq. j7j| to the measured two-particle pseudorapidity correlation function. The ICM provides a 
good fit to the data over a large range in Arj, as shown in Fig. [4] 



The statistical uncertainties of the fit parameters are much smaller than the systematic ones. 
The correction for event selection efficiency (see Section |5.2| has an overall systematic uncer- 
tainty of less than 2.8% found by comparing the result at the generator level to that from the 
reconstructed tracks after corrections. The model dependence of this procedure (i.e. the se- 
lection efficiency for NSD events) was investigated by using correction factors derived from 



9 



different MC generators such as PYTHIA, PHOJET [2% and HERWIG++ [30J. The biggest 
discrepancy in the final results was about 2.6%. 

Systematic uncertainties due to track quality cuts were examined by loosening the cuts on 
the significance of both the transverse track impact parameter, d X y/a{d xy ), and the distance 
along the beam to the primary vertex, d z / a(d z ) from 3 to 5. The final results were found to be 
insensitive to these track selections to within 1.2%. 

A summary of systematic uncertainties for the inclusive analysis is given in Table [2] The un- 
certainties are presented for the cluster model fit parameters listed in Table |2j namely the cor- 
relation strength (a = K e ^ — 1) and the width in pseudorapidity (8). 

Table 2: Summary of systematic uncertainties in the inclusive analysis. 



Systematic uncertainties [%] 



Source 


Keif ~ 1 («) 


8 


Track quality cuts 


1.2 


1.0 


Correction for tracking/ acceptance efficiency and fake rate 


1.3 


1.4 


Correction for event selection efficiency 


2.6 


2.8 


Model dependence of the corrections 


2.6 


1.3 


Total systematic uncertainties 


4.1 


3.5 



Values of effective cluster sizes and widths observed within the kinematic cuts on pj and \t]\ 
are summarized in Table HI 

Table 3: Final results on K e {{ and 8 measured within the kinematic cuts of pj > 0.1 GeV/c and 
|jy| < 2.4 at CMS. 







§\fl\<2A 


0.9 TeV 
2.36 TeV 
7 TeV 


2.12± < 0.01 (stat.) ± 0.05 (syst.) 

2.23 ± 0.02 (stat.) ± 0.05 (syst.) 
2.34± < 0.01 (stat.) ± 0.06 (syst.) 


0.53± < 0.01 (stat.) ± 0.02 (syst.) 
0.52± < 0.01 (stat.) ± 0.02 (syst.) 
0.51± < 0.01 (stat.) ± 0.02 (syst.) 



As can be seen in Fig. |4j the most central point of the 1-D pseudorapidity correlation func- 
tion always lies slightly above the fits. This could be due to the residual effects of secondary 
processes that were not fully removed by the track selection, as well as BEC or other physics 
processes at this small scale in Arj. All fits exclude this central point, but including it in the fit 
affects the values of K e ff and 8 by no more than 0.5%. 

In Fig. |5j CMS measurements of K e ff and 8 for pj > 0.1 GeV/c and \t]\ < 2.4 are shown as 
functions of \/s, and compared with the PYTHIA D6T tune. An energy dependence of K e tf is 
observed, while 8 remains roughly constant over the three energies. PYTHIA shows energy 
dependencies of K e a and 8 similar to those seen in the data, but the magnitude of K e g is signif- 
icantly smaller in PYTHIA. The effect of tensor mesons in PYTHIA was investigated, but even 
using an unrealistically large probability of 50% for the angular momentum L=l meson states 
accounted for only about one third of the difference. Results from the HERWIG++ model were 
also studied and found to have correlation function shapes dramatically different from the 
data, in agreement with previous results [31 j showing that HERWIG++ is insufficiently tuned 
to reproduce soft QCD processes. 
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7 Long-Range Correlations in 7 TeV Data 



Table 4: Final results on K e f{ and 5 measured by CMS after extrapolation to pj > and \rj\ < 3. 
The third quoted uncertainty is due to the extrapolation procedure. 





r \V\<3.0 
K eff 




0.9 TeV 
2.36 TeV 
7 TeV 


2.50± < 0.01 (stat.) ± 0.06 (syst.) ± 0.07 
2.65 ± 0.03 (stat.) ± 0.07 (syst.) ± 0.08 
2.75± < 0.01 (stat.) ± 0.07 (syst.) ± 0.09 


0.64± < 0.01 (stat.) ± 0.02 (syst.) ± 0.03 
0.60 ± 0.01 (stat.) ± 0.02 (syst.) ± 0.03 
0.59± < 0.01 (stat.) ± 0.02 (syst.) ± 0.03 



In order to compare with measurements made at lower energies, CMS results were extrapo- 
lated to 1 77 1 < 3 and the full pj range pj > to achieve a consistent kinematic range. The 
fraction of tracks below pj 0.1 GeV/c was estimated by fitting the measured pj distributions 
using the Tsallis function as was done in Ref. |2T|, which empirically describes both the low 
pr exponential and the high pj power-law behaviors II32I . The integral of the fit function for 
pj < 0.1 GeV/c amounts to about 5.5% of the total yield, consistent with the results in Ref. pl|. 
As first quantified in Ref. [7], the loss of particles falling outside a limited tj acceptance results 
in a significant reduction of both Keg and 5. This effect was investigated using several dynami- 
cal models as well as the simple ICM following the identical approach used in Ref. 0. As was 
the case in the previous analysis, the ratios of K and 3 for different 77 acceptances (\tj\ < 3.0 and 
\rj I < 2.4 in the present work) were found to scale very closely with S^ <2A , the measured clus- 
ter width using data in the \t]\ < 2.4 region, reinforcing the conclusion that the dependence of 
the extracted cluster parameters on pseudorapidity acceptance is primarily a simple geometric 
effect. 

Figure [6] shows the results of K e ff and 5 measured by the CMS experiment after the extrapola- 
tion to 1 77 1 < 3 and pj ~ 0, as well as previous measurements at lower energies in the same 
pseudorapidity range (3j|5H6l. Values of the extrapolated CMS results are summarized in Ta- 
ble |3J where the third quoted uncertainty is due to the extrapolation. The error bars in Fig. [6] 
include the systematic uncertainties from both the experimental measurements and the extrap- 
olations added in quadrature. Events generated with PYTHIA D6T tune show a similar energy 
dependence of K e ft and 5 as the data, but systematically underestimate the magnitude of K e ft 
over the full energy range. 

The observed cluster size cannot be fully explained by a resonance decay model even at very 
low energies, since the expectation of (K) from resonance decays is about 1.5 (extrapolating 
to 1.7 for K e ff depending on the assumed cluster size distribution ||5l). This is significantly 
lower than the observed values, but is close to what is seen in PYTHIA. Additional sources of 
pseudorapidity correlations, such as local quantum number conservation 11331 , are needed to 
describe the data. As the energy increases further (especially at the TeV scale), the onset of jets 
should play a more important role in the particle production, resulting in bigger clusters. This 
effect could be the underlying cause for the observed energy dependence of K e y . 

7 Long-Range Correlations in 7 TeV Data 

The study of long-range azimuthal correlations involved generating 2-D At]-A(p distributions 
in bins of event multiplicity and particle transverse momentum. The analysis procedure was 
to a large extent identical with that used for the minimum bias data described in Section |4] 
With the addition of pr binning, both particles in the pairs used to calculate R{Arj,A(p) were 
required to be within the selected pj range. The events were divided into bins of offline track 
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Figure 6: (a) K e n and (b) 5 as a function of y/s based on a model-dependent extrapolation 
of CMS data to pj ~ and |?/| < 3 (solid circles), as well as data from PHOBOS |6| (solid 
squares), UA5 |5| (solid triangles) and ISR |3| (solid stars) experiments for pp and pp collisions. 
Open circles and squares show the PYTHIA results for the D6T tune and default parameters, 
respectively. 

multiplicity as outlined in Table [T] In order to reach good statistics for the highest attainable 
charged particle densities, only data at 7 TeV were considered. 

Figure [7] compares 2-D two-particle correlation functions for minimum bias events and high 
multiplicity events, for both inclusive particles and for particles in an intermediate pj bin. The 
top two panels show results from minimum bias events. The correlation function for inclusive 
particles with pj > 0.1 GeV/c shows the typical structure as described by the independent 
cluster model. The region at At] «0 and intermediate Acp is dominated by particle emission 
from clusters with low transverse momentum, with some contribution from jet-like particle 
production near (Arj,A(p) (0,0) due to near-side jet fragmentation and a broad elongated 
ridge around Acp n due to fragmentation of back-to-back jets. Also visible is a shallow 
minimum at Acp « at large | Atj \ due to momentum conservation. For the intermediate pj 
region of 1 GeV/c < pj < 3 GeV/c a more pronounced near-side jet peak and away-side ridge 
are visible, due to the enhanced contribution of jet fragmentation to particle production for 
increasing pj. 

For ^-integrated two-particle correlations in high multiplicity events (N°^ ine > 110, Fig. J^:), 
most correlation structures are similar to those for minimum bias events. The cut on high mul- 
tiplicity enhances the relative contribution of high pj jets which fragment into a large number 
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7 Long-Range Correlations in 7 TeV Data 



(a) CMS MinBias, P T >0.1GeV/c (b) CMS MinBias, 1 .0GeV/c<p T <3.0GeV/c 





Figure 7: 2-D two-particle correlation functions for 7 TeV pp (a) minimum bias events with 
Pt > 0.1GeV/c, (b) minimum bias events with 1 < pj < 3GeV/c, (c) high multiplicity 
^offline > 11Q ) events with pr > o.lGeV/c and (d) high multiplicity (N^ e > 110) events 
with 1 < pj < 3GeV/c. The sharp near-side peak from jet correlations is cut off in order to 
better illustrate the structure outside that region. 

of particles and, therefore, has a qualitatively similar effect on the shape as the particle pj cut 
on minimum bias events (compare Fig. and Fig. |7j;). However, it is interesting to note that 
a closer inspection of the shallow minimum at A(p « and \At]\ > 2 in high multiplicity pj- 
integrated events reveals it to be slightly less pronounced than that in minimum bias collisions. 

Moving to the intermediate pj range in high multiplicity events shown in Fig. [TJi, an unex- 
pected effect is observed in the data. A clear and significant "ridge"-like structure emerges 
at A<p « extending to |Aj/| of at least 4 units. This is a novel feature of the data which has 
never been seen in two-particle correlation functions in pp or pp collisions. Simulations using 
MC models do not predict such an effect. An identical analysis of high multiplicity events in 
PYTHIA8 [34J results in correlation functions which do not exhibit the extended ridge at Acp 
seen in Fig. [7ji, while all other structures of the correlation function are qualitatively repro- 
duced. PYTHIA8 was used to compare to these data since it produces more high multiplicity 
events than PYTHIA6 in the D6T tune . Several other PYTHIA tunes, as well as HERWIG++ £50 
and Madgraph 11351 events were also investigated. No evidence for near-side correlations cor- 
responding to those seen in data was found. 

The novel structure in the high multiplicity pp data is reminiscent of correlations seen in rel- 
ativistic heavy ion data. In the latter case, the observed long-range correlations are generally 



13 



0.1GeV/c<p T <1.0GeV/c 
• CMSpp 
— PYTHIA8 



1.0GeV/c<p T <2.0GeV/c 



2.0GeV/c<p T <3.0GeV/c 
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Figure 8: Projections of 2-D correlation functions onto Acp for 2.0 < \Atj\ < 4.8 in different pj 
and multiplicity bins for fully corrected 7 TeV pp data and reconstructed PYTHIA8 simulations. 
Error bars are smaller than the symbols. 

assumed to arise from various components of hydrodynamic flow of the produced medium 
||9l l36l - l39ll , from interactions between hard scattering processes and the medium, and from col- 
lective effects in the initial interaction of the nuclei. 

However, new correlations can also start to emerge in the new energy regime probed here due 
to more elementary processes. For example, long range correlations are predicted also to occur 
in systems with a large number of fluctuating components, e.g. originating from additional 
color string connections. Such effects are presently not modeled in the MC generators. 

Compared to the minimum bias analysis, the online and offline event selection of the rare high 
multiplicity events eliminated some sources of systematic uncertainties, but also introduced 
several additional ones. The bias due to the selection efficiency for NSD events, and its associ- 
ated correction, were not an issue for the high multiplicity analysis since the efficiency reaches 



100% as discussed in Section 5.2 However, it was necessary to correct for the inefficiency in the 
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7 Long-Range Correlations in 7 TeV Data 



HLT selection shown in Fig. [T] Comparison of correlation functions for the high multiplicity 
bin, N°f ine > 110, taken from the two different trigger paths, (N° r £ line > 70 and N°£ line > 85, 
see Section [53]> showed a systematic variation of 4 to 5%. 



The pile-up rate (fraction of events with more than one good offline vertex found) reached 
about 40% for the LHC conditions pertaining for most of the high multiplicity data-taking. 
Studies on correlations between two offline vertices in each event showed that about 10% of 
the events contained pile-up that could not be distinguished by the vertex finding algorithm. 
Therefore, a high multiplicity event could be faked by a pile-up of several minimum bias col- 
lisions with very close vertex positions. Although such pile-up of independent pp collisions 
is not expected to generate additional correlations in this analysis, a data-driven limit on the 
effect of pile-up events was established. This was based on a comparison of results from runs 
with negligible pile-up collected with lower instantaneous luminosity to results obtained with 
high luminosity data at nominal bunch intensity. This comparison was limited by the size of 
the event sample collected for low luminosity conditions in early LHC running. For all lumi- 
nosity selections, the near-side ridge signal was observed and a conservative systematic error 
of 15%, which covers the difference over all run periods, was assigned. 
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Figure 9: Associated yield for the near-side of the correlation function integrated over the 
region of 2.0 < | Ar] | < 4.8 as a function of event multiplicity in bins of pj for 7 TeV pp collisions. 
The error bars correspond to statistical errors, while the brackets around the data points denote 
the systematic uncertainties. The open squares show results for PYTHIA8. 

In order to investigate the turn-on behavior of the "ridge"-like structure quantitatively and in 
finer detail, correlation functions were obtained in four bins of charged particle multiplicity and 
four bins of particle transverse momentum. To study the long-range azimuthal correlations, the 
1-D Acp were calculated by integrating over the 2.0 < \Arj\ < 4.8 region. Figure [8] shows the 
results for a range of pj (from left to right) and multiplicity (from top to bottom) bins. CMS 
data are shown as solid circles and the lines show PYTHIA8 results. In this projection, only the 
range of < Acp < n is shown, as the Acp correlation function is symmetric around A<p = by 
construction. All panels show the away-side jet contribution at Acp « n. In addition, for high 
multiplicity bins in the intermediate pj region, 1 < pj < 3 GeV/c, a second local maximum 
near Acp « is clearly observed. This new feature of the long-range azimuthal correlation 
function is not present in low multiplicity or minimum bias data, which are dominated by the 
low multiplicity events. 

The comparison of data to PYTHIA8 simulations is characterized by two discrepancies: The 
strength of the away-side correlation is over- or underpredicted for almost all bins. This quan- 
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titative discrepancy could be remedied by further tuning of the relative contributions of di-jet 
and multi-jet processes compared to particle production from soft processes in the model with- 
out introducing a qualitatively new mechanism. More importantly PYTHIA8 qualitatively 
fails to reproduce the novel local maximum near Acp « in any of the fj or multiplicity bins. 
It appears that soft particle production from string fragmentation, the contribution from jet 
fragmentation, final-state radiation, and concurrent semihard multi-parton interactions, to the 
extent they are parametrized in PYTHIA8, do not provide a mechanism to create the observed 
long-range, near-side particle correlations. 
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Figure 10: Like-sign and unlike-sign associated yield for the near-side of the correlation func- 
tion integrated over the region of 2.0 < \Atj\ < 4.8 as a function of event multiplicity in bins 
of pj- The error bars correspond to statistical errors, while the brackets around the data points 
denote the systematic uncertainties. 

Figure [H] shows that the long-range, near-side correlation increases in strength with increasing 
multiplicity and is most prominent in the region of 1 < pj < 3 GeV/c. The strength of the 
near-side ridge and its dependence on pj and multiplicity can be quantified in more detail by 
calculating the associated yield, i.e., the number of other particles correlated with a specific 
particle. In the presence of multiple sources of correlations, the yield for the correlation of in- 
terest is commonly estimated using an implementation of the zero-yield-at-minimum (ZYAM) 
method [40 1. The procedure uses R(A</>) integrated over 2.0 < \At]\ < 4.8 as shown in Fig. [8] 
In the first step, a second-order polynomial is fit to R(A</>) in the region 0.1 < \A(p\ < 2.0. The 
location of the minimum of the polynomial in this region is denoted Ai^zyam- The contribu- 
tion from the background source of correlations, in this case the away-side jet correlations, is 
assumed to be zero for \A(p\ < A^zyam- Using the position of the minimum, the associated 
yield is then found by integrating R(Acp) over the region < \A<p\ < A</>zyam relative to the 
minimum in R(A</>) and multiplying by J 2 4 Q 8 B(Af/)d|A?/| to account for the fact that only a lim- 
ited At] range is used. The uncertainty on the minimum level of R(A<p) obtained by the ZYAM 
procedure as well as varying the fit range in A<p gives an uncertainty of 0.0025 on the associated 
yield, uniformly over all multiplicity and pj bins. 

Figure [9] shows the associated yield as a function of event multiplicity integrated over 2.0 < 
\Arj\ < 4.8 in increasing bins of pj. The ridge yield is consistent with zero for low multiplic- 
ity events. The emergence of the ridge is observed toward the very high multiplicity region, 
primarily for the intermediate pj range of 1 — 3GeV/c. The error bars correspond to statistical 
errors, while the brackets around the data points denote the systematic uncertainties. Results 
from the PYTHIA8 MC, shown in open squares in Fig. [9} are consistent with zero for all multi- 
plicity and pr regions, indicating that the ridge observed in the data is totally absent in events 
produced by this generator. 
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8 Conclusion 



To investigate the novel ridge-like structure further, two-particle correlations were calculated 
separately for like-sign and unlike-sign charged pairs. Possible problems related to the track 
reconstruction algorithm, like multiple reconstruction of the same particle or local occupancy 
changes, would be expected to affect like-sign pairs differently than unlike-sign pairs. The 
same choice of pairs of like- or unlike-sign was made for both the signal and background in 
Eq. |T]). Figure 10 shows the associated yield for like-sign (solid circles) and unlike-sign (open 



squares) two-particle correlations respectively as a function of event multiplicity integrated 
over 2.0 < \Aij\ < 4.8 in bins of fj. Consistent multiplicity and pj dependencies of the near- 
side associated yield are observed for charge dependent and charge independent correlations. 
The results for like-sign and unlike-sign pairs agree with each other within uncertainties. Since 
the number of like- and unlike-sign pairs each represent roughly half of the total, the yield of 
associated pairs counting only one sign option is expected to be roughly a factor of two smaller 
than the unrestricted yield. 

As a further cross-check, correlation functions were generated for tracks paired with ECAL 
photons (primarily due to 7T°s) as well as pairs of two ECAL photons. These distributions 
showed similar behavior to those shown in Figs. [7] and [8j i.e., the high \Ar/\ region contained a 
dip at | A(p\ in minimum bias events and a ridge in that region for high multiplicity events. 
Data at 0.9 and 2.36 TeV were also analyzed for long-range correlations, but the statistics were 
not sufficient to draw a conclusion. 



8 Conclusion 

The CMS detector at the LHC has been used to measure angular correlations between two 
charged particles up to | At] | 5 and over the full range of Acp in pp collisions at \fs = 0.9, 2.36, 
and 7 TeV. The extracted 2-D correlation functions show a variety of features. In minimum bias 
collisions they are dominated by a maximum at Arj = extending over the full range in Acp, 
with a width which tends to increase with increasing Acp. A simple cluster model parametriza- 
tion was fit to these short-range correlations in order to quantify their strength (the effective 
cluster size) and their extent in relative pseudorapidity (the cluster decay width). The cluster 
size is observed to increase slowly with beam energy, while the cluster width is essentially con- 
stant. The PYTHIA event generator with D6T tune correctly describes the cluster widths and 
the energy dependence of the cluster size but systematically underestimates the cluster size. 

Long-range azimuthal correlations for 2.0 < | Arj \ < 4.8 have been studied for 7 TeV data, lead- 
ing to the first observation of a long-range ridge-like structure at the near-side (Acp 0) in pp 
collisions. This striking feature is clearly seen for large rapidity differences \Arj\ > 2 in events 
with an observed charged particle multiplicity of N ~ 90 or higher. The enhancement in the 
near-side correlation function is most evident in the intermediate transverse momentum range, 
1 < pj < 3 GeV/c. In the 2.0 < \Atj\ < 4.8 range, a steep increase of the near-side associated 
yield with multiplicity has been found in the data, whereas simulations show an associated 
yield consistent with zero, independent of multiplicity and transverse momentum. The novel 
structure resembles similar features observed in heavy ion experiments 0-(9l. However, the 
physical origin of our observation is not yet understood. Additional characteristics of the high 
multiplicity pp events displaying this novel feature deserve further detailed study. 
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